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Abstract 
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£^ ' The properties of an alternative definition of quantum entropy, based on Wigner func- 

tions, are discussed. Such definition emerges naturally from the Wigner representation of 

^C) ' quantum mechanics, and can easily quantify the amount of entanglement of a quantum 

state. It is shown that smoothing of the Wigner function induces an increase in entropy. 
' This fact is used to derive some simple rules to construct positive definite probability 

distributions which are also admissible Wigner functions. 
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I. Introduction 



Entropy is the central concept of thermodynamics and statistical mechanics. It was intro- 
duced by Clausius in the mid-19th century as a phenomenological variable that quantifies 
the intrinsic irreversibilty of thermodynamical processes. It was Boltzmann who recognized 
the link between entropy and the lack of information about a system, defined as the number 
r of microstates which have the same macroscopic properties. The celebrated formula 

S B = k B lnT (1) 

establishes such a link in a mathematically rigorous manner (in the rest of this article we shall 
use units for which k B = 1: with this prescription, entropy becomes a dimensionless quantity). 
Boltzmann, of course, derived this formula in the context of classical statistical mechanics. 
In classical physics, microstates are defined as points in a continuous 2D-dimensional phase 
space (D is the number of degrees of freedom of the system under consideration) , and cannot 
be "counted" in any meaningful sense. Therefore, Boltzmann took as the number T of 
microstates the available volume in phase space Q divided by the volume of a unit cell 
(unspecified at the time when Boltzmann published his work, but which will turn out to be 
Planck's constant, raised to the appropriate power, h D ): T = Q/h D . In quantum mechanics, a 



microstate is described by a wave function, which contains all the information about the state 
of the system. In contrast to the classical case, now there is no ambiguity, since quantum 
states are discrete in principle. Hence, although the macrostate has a huge number of possible 
microstates consistent with it, this number, T, is nevertheless definite and finite. 

The most general quantum system is described by a density matrix, i.e. a positive-definite, 
Hermitian operator, with unit trace. In terms of the density matrix p, the entropy can be 
expressed in the following way, due to Von Neumann 

<Svn = — Tr pin p. (2) 

This is the standard definition of entropy, which generalizes Boltzmann's expression to quan- 
tum mechanics. Although unambiguously defined, however, Svn can be extremely difficult 
to compute in practice, since one would need to diagonalize p in order to compute the trace 
of its logarithm. Von Neumann's entropy (VN) has a number of good properties, which will 
be detailed in the following sections. Here we note that, if Oj > are the eigenvalues of 
the density matrix (J2i a i = 1)> the VN entropy becomes Svn = —J2i a i^ na i- Therefore 
SVn > 0, and the equality holds only if we have complete information, i.e. if only one of the 
eigenvalues is different from zero: in this case, the system is in the pure state corresponding to 
this eigenvalue. Another crucial property of Svn is that it is conserved as p evolves according 
to the quantum Liouville equation 

ih^ = Hp-pH , (3) 

where H is the Hamiltonian. Indeed, the trace of any functional F of the density matrix 
Tr F(p) is also conserved. This fact can be used to define other entropy-like quantities. Not 
all this quantities are equivalent, however, and we will show in the following section that only 
one of them is particularly adapted to the Wigner representation of quantum mechanics. 

The classical limit of the Von Neumann entropy, Eq. (|2|) , is obtained by replacing the density 
matrix with the phase space probability distribution f(x,p) (for simplicity, we will consider 
systems with only one degree of freedom, D = 1), and the trace with the integral in phase 
space. One obtains the following expression, due to Gibbs 

S CL = - J fHfh) dxdp , (4) 

and the probability distribution is positive and normalized to unity. Note that the classical 
entropy is defined up to an additive constant, which means that the constant h in the argu- 
ment of the logarithm in Eq. (^) can be chosen arbitrarily, although it seems reasonable to 
use Planck's constant h = 2irfi. Indeed, if / is constant inside a certain phase space volume f2 
and zero elsewhere (i.e. at thermodynamic equilibrium), then Scl = ln(f2//i), in agreement 
with Boltzmann's original definition, Eq. (|l]). We also stress that Scl can take negative 
values, in contrast with SvN) which is always non- negative. From the previous discussion, it 
is easy to conclude that Scl will be negative when Q < h. This means that we are trying 
to localize a particle on a phase space region smaller than Planck's constant, and therefore 
violate the uncertainty principle. For probability distributions that satisfy the uncertainty 
principle, the classical entropy is positive. Similarly to the quantum mechanical case, the 
classical entropy is conserved for a Hamiltonian process, i.e. when the probability distribu- 
tion evolves according to the classical Liouville equation. Again, the phase space integral of 
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any functional F(f) is also conserved (indeed, / itself is conserved, since it is just transported 
along the classical trajectories). 

In this paper, we discuss the properties of an alternative definition of quantum entropy, based 
on Wigner functions. Although this entropy has already been known for some time (generally 
expressed in terms of the density matrix), we feel that its properties are not fully appreciated. 
In particular, it will be shown that such a definition of entropy emerges naturally from the 
Wigner representation of quantum mechanics. It has therefore a privileged status compared 
to the many other definitions proposed in the literature, and deserves to be studied in some 
depth. 

The Wigner representation 0] is a useful tool to express quantum mechanics in a phase space 
formalism (for reviews see f3|^@). Although it was derived by Wigner for technical purposes, 
this approach has recently attracted much interest, since it is well-suited to analyze the 
transition from classical to quantum dynamics. The Wigner representation can deal with 
both pure and mixed quantum states, and is completely equivalent to the more usual picture 
based on the density matrix. In this representation, a quantum state is described by a Wigner 
function (i.e. a function of the phase space variables — see next Section), and the Wigner 
equation provides an evolution equation for the state which is equivalent to the quantum 
Liouville equation (||). It will be shown that, if one tries to define an entropy functional in 
the framework of Wigner's representation, only one 'reasonable' choice is possible, and this is 
discussed in the next Section. Subsequently, we will discuss the properties of such an entropy 
(Sec. Ill), and present some examples of its applications in Sees. IV and V. 

II. Quantum Entropy 

The quantum distribution function W(x,p) is defined in terms of the density matrix p(x,y) 
for a quantum mixed state 

w{x ' p) = 2k 1 9 { x ~ ^ X + 1) exp (x) dX ' (5) 

or in terms of the wavefunction vp(x) for a pure state 

W{x , p) _ -LJl, ( x _ *) ^ (, + *) exp [f) d X . (6) 

The function W(x,p) possesses many of the properties of a phase space probability distri- 
bution: it is real, normalized to unity, and, when integrated over x or p, gives the correct 
marginal distribution, e.g. J Wdp = p(x, x) = spatial density. Furthermore, it can be used to 
compute averages of any dynamical variable A(x,p) : (A) = J W A dxdp. Note however that, 
since some terms in A(x,p) may not commute, it is necessary to establish a non-ambiguous 
correspondence between classical variables and quantum operators (Weyl's rule) Despite 
these good properties, the Wigner function cannot be interpreted as a probability distribu- 
tion, since it can assume negative values. The only pure state whose Wigner function is 
positive definite is given by the minimum uncertainty packet (i.e. a Gaussian wavefunction). 

The evolution of W(x,p,t) is governed by the Wigner equation, which replaces the classical 
Liouville equation : 

exp \ —^{p — p')z \ W(x,p' ,t)dzdp' , 

(7) 
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where is the potential. The Wigner equation is equivalent to the quantum Liouville 

equation (|3|), and can describe the evolution of both pure states and mixtures. However, 
in the present work, we shall privilege the Wigner formalism over the density matrix one, 
since it is easier to represent in the classical phase space, and it allows a more staightforward 
treatment of the semi-classical limit. 

We would like to define an entropy functional in terms of Wigner functions. The classical 
choice, Eq. (||), obviously cannot work, since W can assume negative values. It is easy to 
show the existence of two simple functionals of W that are invariant under Eq. ([?]) : the first is 
the total probability J Wdx dp = 1; the second invariant is J W 2 dx dp, which has no obvious 
physical meaning. We stress that this is a property of Eq. (0), and does not depend on 
whether W represents a pure state, a mixture, or even a state which violates the uncertainty 
principle. However, the fact that the latter expression is indeed invariant, suggests that we 
introduce the following definition of entropy 

S 2 = l- (2<Kh) D J W 2 dx dp , (8) 

where D is the number of degrees of freedom: except where otherwise stated, we will always 
work with systems for which D = 1. 

The S 2 entropy can be expressed in terms of the density matrix p 

S 2 = 1 - Tr p 2 , (9) 

a result which follows from the fact that W is related to the Fourier transform of p. Equation 
has been used in the literature as an entropy-like quantity and sometimes referred to 
as the linear entropy. Its relevance to Wigner functions has been noticed by some authors 
[||, but its full implications have not, to our knowledge, been appreciated and developed. 
We first notice that this is the only expression of entropy having the same functional form 
when expressed in terms of either W or p (for example, / W 4 is not simply related to Tr p ). 
Secondly, and most importantly, the very structure of Wigner's equation selects the functional 
S 2 as a special candidate for a definition of entropy. It is therefore important to study its 
properties and implications. 

When W is an admissible Wigner function (i.e. when it represents either a pure or a mixed 
quantum state) , the previous entropy satisfies the relation < S 2 < 1 , and 52 = holds for a 
pure state, which is a reasonable result, since pure states contain the maximum information 
available. Indeed, it is possible to define quantum information as the complement of S 2 to 
unity, I = 1—S 2 . Note that S 2 can become negative only for states that violate the uncertainty 
principle, as it will be explained in Sec. III. We point out that S 2 = is a necessary, but 
definitely not sufficient condition for the corresponding Wigner function to represent a pure 
state ||. This can be shown by finding a counter-example. Let us define the Wigner function 
as W = J2i=i a iWii where the W{ are orthogonal pure states, and a\ = a 2 = 2/3, Q3 = —1/3. 
Even though the coefficients ai sum up to unity, W does not represent an admissible Wigner 
function, since one of the coefficients (which represent probabilities) is negative. However, 
it is simple to prove that S 2 [W] = 0. Incidentally, this example has shown the existence of 
phase space functions which represent neither pure states nor mixtures. This point will be 
discussed in more detail in the next Section. 
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This entropy is related to a formula proposed by Tsallis ]7|, which has stimulated much work 
in the last decade (see, for example, || and references therein). If is a set of probabilities 
adding up to unity, Tsallis entropy is defined by 

1-Ya q 

s q = g r; > ( 10 ) 

where q is a real, not necessarily positive, number, and the standard entropy is recovered for 
q — ► 1. Tsallis entropy is a possible, and indeed useful, way to generalize the Boltzmann-Von 
Neumann expression, and has been employed by several authors to study the thermodynamics 
of strongly correlated systems, such as self-gravitating gases and inviscid fluids f8j. 

Equation (|8|), is the continuous counterpart of the discrete Tsallis entropy with q = 2. The 
continuous formula can be recovered by the following heuristic argument. Let us cover the 
phase space with cells of size AxAp. The discrete probabilities are then aj = W(xi,pi)AxAp, 
and the discrete entropy becomes 

S 2 = 1- AxApJ2W 2 (xi, Pi )AxAp . (11) 

i 

The sum in Eq. ([ll]) gives the integral / W 2 dxdp. However, we cannot let the factor AxAp 
in front of the sum go to zero, since this would violate the uncertainty relation. Indeed, we 
obtain the correct continuous formula [Eq. @ with D = 1] by taking for AxAp the smallest 
value allowed by quantum mechanics, i.e. Planck's constant h = 2irh. 

Another way to go from the continuous to the discrete formula, is to consider a Wigner 
function that is the sum of N orthogonal pure states W(x,p) = J2iL\ otiWi(x,p). Of course 
W represents a quantum mixture. We recall the following useful relation, valid for orthogonal 
pure states: 

J W l W j dx dp = 5 ij /2Tih , (12) 

where <5jj is the Kronecker delta. By developing W in terms of the Wi in Eq. (|8]) , and making 
use of Eq. fll^), we obtain Tsallis discrete entropy £2 = 1 — J2iL\ ck?. We stress again that the 
above properties are valid for the quadratic entropy S2, but do not hold for other functionals 
involving higher powers of W. 

It is interesting to show that a local entropy a and an entropy flux J$ can also be defined: 

a(x,t) = [ Wdp-2nh [ W 2 dp , Js(x,t) = [ —Wdp-2^fi [ —W 2 dp . (13) 
J J J m J m 

Of course one has S2 = J crdx. By multiplying Eq. (|7j) by W and integrating over momentum 
space, one can prove that the local entropy obeys a continuity equation : 

which shows that entropy can be transfered from one spatial location to another, but is 
globally conserved. The physical meaning of a is easier to grasp if we express it in terms of 
the density matrix in the position representation. With the help of Eq. (|B|) one finds (we 
drop the time dependence) 



a{x) = p(x, x) 



p(x-\/2,x + \/2) 



2 



dX . (15) 
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Equation ( |T5| ) shows that entropy is closely related to the off-diagonal terms of the density 
matrix. For a pure state, p(x,y) = ip{x)il)* {y) (ip is the wavefunction) , and the local entropy 
can be expressed in terms of the spatial density n(x) = \tp(x)\ 2 = p{x,x) 

a(x) = n(x) — J n(^x — — \ n ^x + — ^ dX = n{x) — t{x) , (16) 

where we have defined the local quantum information i{x) so that I = J tdx. It appears 
that l(x) is a density autocorrelation function, which shows that, in quantum mechanics, 
information and spatial correlations are intimately close concepts. 



III. Properties of Quantum Entropy 

The expression given in Eq. @ has proven to be a fruitful tool to quantify some key properties 
of quantum systems, such as nonlocal correlations. In order to be an appropriate definition 
of entropy it should nevertheless satisfy some standard properties || , among which concavity 
and additivity are particularly fundamental. Some of these properties were previously studied 
by Tsallis |7j for the discrete case. 

1. Concavity. This means that, if W = YliLi a iWi (where the Wi are not necessarily pure 
orthogonal states), then the following inequality holds 

N 

S 2 [W] >5>iS2[Wi] . (17) 

i=l 

The proof is obtained by direct calculation for N = 2, and then is easily extended to higher 
N by recursive arguments. 



Note that we can also prove an upper bound for S2 

N N 

S 2 [W] < a$S 2 \Wi] + 1 - E «f > ( 18 ) 

i=l i=l 

which holds for W{ representing both pure states or mixtures. The term 1 — J2i a "i represent 
the so-called mixing entropy. The proof of Eq. (|l8|) relies on the following inequality 0] 

WiWj dxdp > , (19) 

which is valid for all admissible Wigner functions, pure or mixed states (see Sec. IV for a 
definition of admissibility). When the Wi represent pure states, then S^Wi] = 0, and Eq. 
(18) becomes 

N 

S 2 [W}<l-J2<* 2 i ■ (20) 

8=1 

The equality sign holds when the Wi are also orthogonal, as was shown in Sec. II. 



2. Additivity. Let us consider two independent subsystems A and B. The Wigner function 
W describing the total system A U B is simply given by the product of the Wigner functions 
Wa and Wb for the two subsystems 

W(x a ,Pa,xb,Pb) = W a (x a ,Pa) W b (x b ,Pb) ■ (21) 
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It is easy to show that both the classical entropy, Eq. (|j), and the Von Neumann entropy, Eq. 
@, are additive ||, i.e. S[W] = S[Wa] + <S[Wb]. This is a key property, since it enables one 
to identify the statistical entropy with the thermodynamical entropy, which is also additive. 

By contrast, our definition of entropy is not additive in the usual sense. Let us first notice 
that, whereas the number of degrees of freedom of each subsystem is D = 1, the total system 
has D = 2. Therefore the information is defined as /[W^b] = hj W\ B for each subsystem, 
and I[W] = h 2 J W 2 for the total system. With this in mind, it is easy to establish the 
following expression for the quantum information 

I[W] = I\W A ] I[W b ] , (22) 

which shows that, since I < 1, the information contained in the total system is smaller than 
the information of each subsystem, except for pure states, for which 1 = 1. In terms of the 
entropy S2 = 1 — I, Eq. ( p^ ) becomes 

S 2 [W] = S 2 [W A ] + S 2 [W B ] - S 2 [W A ] S 2 [W b ] . (23) 

The total entropy is therefore smaller than the sum of the partial entropies, but larger than 
each of them. Note that when the subsystems are "almost pure" quantum states, then 
S^W^b] <C 1, and the non-additive correction to Eq. (|23|) becomes of higher order. In this 
case, approximate additivity is recovered. 



It is also interesting to note that Eq. (23) is formally identical to the expression for the 



probability of the union of two subsets A and B, which reads 

prob(A LIB) = prob(A) + prob(B) - prob(A n B) , (24) 

and prob(A Pi B) = prob(A) prob(B) for statistically independent systems. The analogy of 
S 2 as probability is also consistent with the normalization < S 2 < 1. 

3. Subadditivity. If the subsystems A and B are not independent, the Wigner function 



cannot be factored as in Eq. (21). The Wigner function of each subsystem is then defined 



by integrating over the other system's variables, for instance 

W a (xa,Pa) = J W(x A ,PA,XB,PB)dx B dpB , (25) 

and similarly for Wb- For the Boltzmann-Von Neumann entropy, one can prove that S[W] < 
S[Wa] + S[Wb], and the equality sign holds when the two subsystems are independent @. 
This means that the total system A L) B contains more information than the sum of its parts 
— which is natural, since the two subsystems are correlated. However, no such relation can 
be proven for S 2 : this entropy is therefore not subadditive. Note that this fact is consistent 
with the analogy of S2 as probability given by Eq. (|24"|). Indeed, when the subsets A and B 
are not independent, the probability of their intersection prob(A n B) can be either smaller 
or larger than the product prob(A) prob(B), corresponding to either negative or positive 
correlation. 

4- Microcanonical Ensemble. We want to extremize the entropy S2 with the constraint 
fWdxdp = 1. Using Lagrange multipliers, it is easy to show that the entropy is maximum 
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when W = const. = Q 1 within a phase space region of volume (area) equal to fi, and W = 
elsewhere. In this case the entropy is 

S 2 = l-^, (h = 2Kh). (26) 

This is the analog of Boltzmann's formula, Eq. ([[]), when the appropriate additive constant 
is used, i.e. Sb = ln(fi//i). For both expressions, S = when 0, = h (minimum uncertainty), 
and the entropy becomes negative when f2 < h, i.e. when the uncertainty relation is violated. 
In limit f2 — > oo, S2 is bounded, and tends to unity (least information). With this notation, 
information I = 1 — S2 is just the inverse of the number of available microstates 0,/h. 

5. Canonical Ensemble. We now extremize S2 with the constraints / Wdxdp = 1 and 
J WEdxalp = U, where E(x,p) = p 2 /2m + $(x), and U is the average energy. Again using 
Lagrange multipliers, we find the following equilibrium distribution 

W cq (x,p) = Z'\\ - (3E(x,p)} , 0E<1 . , 

W CCL (x,p) =0 , (3E>1 { n 

where (3 is the Lagrange multiplier corresponding to the energy constraint, and can be in- 
terpreted in the usual fashion as the inverse temperature (3 = 1/T; Z is a normalization 



constant. For energies such that 0E <C 1, Eq. (27) becomes identical with the standard 
exponential Boltzmann factor exp(— (3E). Since W eq is a linear function of the energy, we 
have been forced to introduce a cut-off, otherwise W eq would diverge for large values of E. 
Physically, this means that states with energy E > T are forbidden at equilibrium. Note the 
difference with standard thermodynamics, where such states are highly improbable (because 
Boltzmann's factor decreases exponentially), but not forbidden in principle. 

An interesting fact is that Eq. (^7|) is a stationary solution of the Wigner equation (Q) — 
indeed, we are aware of no other stationary solution which is also a function of the energy 
E(x,p) alone. This is easy to prove when the right-hand side of Eq. (0) is written as 



00 

° n Q x 2n+l Q v 2n+1 
n=0 " 



where the c n are constants. The n = term yields the classical part of Wigner's equation, 
whereas all other terms do not provide any contribution, since W eq is quadratic in p. More- 
over, since Weq is a function of the energy alone, it is a stationary solution of the classical 
Liouville equation, so that we have finally dW eq /dt = 0. The fact that maximizing the 
entropy S2 naturally yields a Wigner function which is both stationary and a function of 
the energy alone is in itself remarkable. At the present stage, it is premature to make any 
statement about the role of W eq , but the subject certainly deserves further attention. For 
example, it would be interesting to know if, and under what constraints, W eq can act as an 
attr actor in a relaxation process. 

IV. Smoothed Wigner Functions 

The Wigner function cannot be interpreted as a genuine probability distribution because it 
almost always takes negative values. The only pure state whose Wigner function is positive 
is given by the minimum uncertainty Gaussian wavepacket : 

%l>{x) = (2^)- 1 /V- 1 / 2 exp(-x 2 /4a 2 ) , (28) 
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whose Wigner function is also Gaussian 



e( ,,p)__^^_-L s _j . (29) 

A possible way to obtain a positive distribution is to smooth a pure Wigner function W(x,p) 
using a kernel K(x,p) which is itself a Wigner function correponding to a pure state fllPf . 
The smoothing operation is represented mathematically by a convolution in phase space. The 
smoothed Wigner function W(x,p) 

W(x,p) = J W(x',p')K(x -x',p- p) dx'dp' = W*K , (30) 

is then positive and normalized to unity, so that it can be interpreted as a probability distri- 
bution. 



In the past, the most common choice of the smoothing kernel has been the minimum uncer- 
tainty Gaussian G(x,p), as given in Eq. (pS| ) [p7c| l . The resulting smoothed Wigner function 
is sometimes referred to as the Husimi function. This choice is however quite arbitrary, and 
no argument has ever been proposed, to our knowledge, in order to justify its privileged 
status. We shall now prove that smoothing with a Gaussian kernel does have some special 
properties, and should therefore be regarded as the correct way to obtain positive smoothed 
Wigner functions. In particular, it will be shown that, when the smoothing is performed with 
a Gaussian kernel, the result is still an admissible Wigner function. 

First of all, we need a precise definition of an admissible Wigner function. Of course, not all 
functions of the phase space variables are admissible: for example, those functions which vi- 
olate the uncertainty principle are clearly not admissible. Functions that can be constructed 
by summing orthogonal pure states, such as W = J2i a iWi, are not admissible if some of 
the ai are negative : this was the example analyzed in Sec. IV. Our definition of an admis- 
sible Wigner function is rather standard and is based on the density matrix formalism. 
According to standard quantum theory, a density matrix p must satisfy three properties in 
order to describe a quantum mixed state : (1) it must have unit trace Tr p = 1; (2) it must 
be Hermitian p(x, y) = p*(y, x); and (3) its eigenvalues must be non-negative. While the first 
two properties are easy to verify, the third is much harder to test, since one would need to 
diagonalize p in order to compute its eigenvalues. Property (3) can also be expressed in the 
following way : 

ip(x) p(x,y) ip*(y)dx dy > 0, Vt/>, (31) 



where the inequality must hold for all wavefunctions ip. This makes it even more apparent 
that Property (3) cannot be used as an operational test. 

Now, the previous properties can be transposed to Wigner functions by making use of the 
definition, Eq. (||) . In particular we would like to know whether the smoothed Wigner function 
W is in general admissible or not. Properties (1) and (2) simply require that W be real and 
normalized to unity. Property (3) can be written in the following form Q 

J W(x,p)F(x,p) dxdp > 0, \/F(x,p) = pure state . (32) 



The equivalence between Eqs. (|3l|) and (|32| ) can be verified by noting that W and F are 



the Wigner transform of, respectively, p and ifi, as defined in Eqs. (||-||). It is clear that, in 
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order to check the admissibilty of W(x,p), one should perform an infinite number of integrals 
involving test Wigner functions F(x,p) that represent pure states. However, Eq. ( |32|) can 
be used to prove that smoothing with a Gaussian kernel yields a smoothed Wigner function 
which is itself admissible. 



In order to do so, let us plug Eq. (p0[) into the left hand-side of Eq. (32). We obtain (W is 



the original Wigner function, K is the smoothing kernel, and F is the test function: all three 
represent pure states) 

/ W(x — x',p — p')F(x,p)K(x' ,p') dx'dp'dxdp 

= J K(x',p') dx' dp' J W\{x — x' ,p — p')F(x,p) dxdp (33) 
= JK(x?,i/)\Wi * F](x',p') dx'dp' , 

where Wx(x,p) = W(— x, — p) is the Wigner function corresponding to the wavefunction 
ip{—x) [whereas W corresponds to ip(x)]. The term W\ * F is certainly a positive function, 
since it is the convolution product of two Wigner functions. It follows that a sufficient 
condition for Eq. p2| ) to be satisfied is that K(x,p) be positive. But the only pure state 
Wigner function which is also positive is the Gaussian G(x,p) [Eq. (p9|)1. This proves that, 
when the smoothing kernel is Gaussian, the inequality given in Eq. (^2[) is verified, and the 
smoothed Wigner function W(x,p) is therefore admissible. In this case, the density matrix 
~p corresponding to W can be written as 



The previous result can be easily checked by computing the Wigner function W associated 
to p via Eq. @, and realizing that it can be written as W = W * G. Equation (|34|) expresses 
the density matrix as a continuous sum of localized states in phase space ('coherent states' 
[0])- Note that the coefficients in this sum [i.e. W(x,p) itself] are not necessarily positive 
numbers. The reason for this is that the set of coherent states is 'overcomplete', meaning 
that the representation of an arbitrary quantum state in terms of coherent states is not 
unique. However, thanks to the previous theorem, we know that a diagonal representation of 
~p with non-negative coefficients does exist, although we are not generally able to construct it 
explicitly. 

So far we have proven that smoothing with a Gaussian kernel yields a function W which is 
itself an admissible Wigner function. Nothing definite can be said when the smoothing is 
performed using a different kernel. However, we are able to produce a counterexample, i.e. 
a pure state Wigner function which, after smoothing with a non-Gaussian kernel, does not 
satisfy Eq. (|32[), and is therefore not admissible. Let us consider the wavefunction 

tp(x) = 2(2/yr) 1/4 xexp(-x 2 ) , (35) 

and call W(x,p) its Wigner transform. Now we smooth W using as kernel W itself: 

W = W * W . (36) 



In order to be an admissible Wigner function, W must satisfy Eq. fl32|) for every test function 
F. Let us use as test function once again W itself, and compute the integral in Eq. (|32|). We 
obtain (details are in the Appendix) 

W{x,p)W(x,p) dxdp = -—— < . (37) 

ZITTtl 
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This result shows that not all ways of smoothing Wigner functions are equivalent: only by 
smoothing with a Gaussian kernel we are certain to obtain a function which is positive and 
also represents an admissible quantum state (i.e. a state defined by a density matrix with 
real non- negative eigenvalues). 

Furthermore, Eq. ( p3| ) suggests another way to construct a phase space distribution which 
is both positive and admissible [satisfying Eq. (fffi) ]. Let us take for W(x,p) an arbitrary 
positive function of phase space variables, and smooth it with a Gaussian kernel G(x,p): 
W = W * G. We want to prove that W is admissible. Equation (|3^) yields (using the fact 
that G is even) 

/ W(x — x',p — p')F(x,p)G(x' ,p') dx'dp'dxdp 

= JW{x',p') dx'dp' J G{x' - x,p' - p)F(x,p) dxdp (38) 
= / W{x',p')[G * F](x',p') dx'dp' > . 

The result follows from the fact that the convolution product is positive, since both F and 
G are pure state Wigner functions, and W > because we chose it to be so. This proves 
that W(x,p) is an admissible Wigner function, and is also positive, since it is the convolution 
product of two positive functions. The density matrix corresponding to W is again p, as 
given by Eq. (|34|). Physically, the smoothed function W = W * G can be interpreted as the 
admissible quantum state which best approximates the classical state W for a given value of 

h. 



To conclude this Section, we restate the two main results that have been obtained here. We 
have shown two possible ways to construct a phase space distribution which is both positive 
and an admissible quantum state. This can be performed (a) by smoothing a pure state 
Wigner function with a Gaussian kernel, or (6) by smoothing an arbitrary (but positive) 
function of phase space variables, again with a Gaussian kernel. Therefore, the Gaussian 
function G(x,p) given in Eq. (29) has a privileged status as a smoothing kernel. Note, 
however, that G is not unique, since it depends on the parameter a. 



Although such results were derived for a pure state Wigner function, they can easily be 
generalized to mixtures. It follows that, when smoothing several times with a Gaussian 
kernel, we still remain within the class of admissible Wigner functions. This class is therefore 
closed with respect to this particular operation. 



V. Entropy and Smoothed Wigner Functions 



The smoothing operation has the effect of erasing some of the correlations in the phase 
space. We expect therefore that smoothing should increase the entropy. This is not difficult 
to prove. In order to do this, we need to define the double Fourier transform of a Wigner 
function W(x,p) 

W(k, A) = J J W(x,p) exp(—ikx — iXp)dx dp . (39) 
By means of Eqs. (||) and (|39[), one obtains for a pure state 



Xh\ 



2 J 



W(k,X) : 

We can then easily prove the following Lemma 

l^(M)l 2 < 



ip [x ) ip [x H exp(—ikx)dx 



( • ( x — ^ dx 



xn\ 2 

x + — ) dx 



(40) 



(41) 
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where use has been made of Schwartz's inequality. 



Now, let us take an arbitrary Wigner function W(x,p) and smooth it with a kernel K(x,p) 
which is a pure state: W(x,p) = W(x,p) * K(x,p). In Fourier space we have : W(k, A) = 
W(k, X)K(k, A). The quantum information I[W] = 2ixfi J W dxdp relative to W satisfies the 
inequalities 



IW\ = ^J\W(k,X)\ 2 dkdX = ^J\W(k,X)\ 2 \K(k,X)\ 2 dkdX 

< max \K(k, X)\ 2 x^J \W(k, X)\ 2 dkdX 

= max \K(k, X)\ 2 x 2-nfi J W 2 dxdp < 2irhfW 2 dxdp = I[W] 



(42) 



where we have used the previous Lemma [Eq. (|4l|)] for K, as well as Parseval's identity in 
the form 

j W 2 (x,p)dxdp = j \W{k,X)\ 2 dkdX . (43) 



Equation (42) implies that 



S 2 [W] > S 2 [W] 



(44) 



i.e. the smoothing operation has increased the entropy. Note that, in order to obtain this 
result, the smoothing kernel needs not be a Gaussian. 



Now we turn to the case where the smoothing kernel is indeed Gaussian. In this case, a 
relatively simple expression for I[W] can be obtained. The double Fourier transform of the 
Gaussian defined in Eq. (29) is 



k 2 a 2 X'K 



G(k, X) = exp 



2*2" 



8a 2 



The Fourier transform of the Wigner function W to be smoothed is given by Eq. 
us compute the information : 



I[W] = 2ith ( W 2 dxdp = A 
J 2tt 

Expressing W and G by means of Eqs. (|40| 



\W(k,X)\ 2 \G(k,X)\ 2 dkdX . 

one obtains, after some algebra 



(45) 
Let 

(46) 



ip x' 



X 



exp 



X 2 + (x — x 
4^2" 



l\2 



dxdx'dX 



(47) 



We now change the integration variables, using the following unitary transformation 



x' 
X 



\w + \y + \z 
i i l 

z 



2 W ~ 2V 2 

-y + z. 



(48) 



After some algebra, the following result is obtained 

1 



I[W] 



dw 



^(w + y)ip(w — y) exp 



2a 2 



dy 



(49) 
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which expresses the quantum information in terms of the wavefunction tj) corresponding to 



the unsmoothed Wigner function W. Equation (4£) may be usefully employed to monitor the 
time evolution of the entropy in a numerical simulation: i/j(x, t) would then evolve according 
to the time-dependent Schrodinger equation. 



Finally, we show that a more stringent bound than the one expressed by Eq. (44) can be 



obtained when the smoothing kernel is Gaussian. By using again Schwartz's inequality, we 
have from Eq. (f49|) 



I[W] < — y= J dw J dy exp ( — ^ j / " dy' ' \ i/)(w + y')tp(w - y' 



dw I dy' | tf>(w + y')ijj(w — y') \ 2 = — . (50) 



In terms of the entropy, this becomes 



2 



S*W\ > \ > (51) 



a result that is valid when smoothing a pure Wigner function with a Gaussian kernel. Note 
that we still have some freedom in the choice of the kernel, since the width a of the Gaussian 
in Eq. (29) is still unspecified. It would be interesting, for example, to know which value of a 



minimizes the entropy S^fW], within the bounds given by Eq. (|5l|). We have not been able 
to obtain a general result, but some indication can be obtained from the following example. 
Let us suppose that the function W to be smoothed is also a Gaussian, as in Eq. (|29|), but 
with spatial variance \i instead of a. The smoothed Wigner function is then 

^> = w * G =:d^(-|5-|l) • (52) 



with 



h 2 / 1 I 

2 _2 , ..2 . v^2 ' 



4 \a 2 ij? 



The information corresponding to W is 



I[W] = 2ttTi / W 2 dxdp = — — . (53) 

After some algebra, one obtains the following expression 

I[W] = I(z) = -JLj , (54) 
1 + z z 

where z = a / 1 [i. The function I(z) attains its maximum for z = 1, i.e. when a = //, and 
the kernel has the same variance as the Wigner function to be smoothed. In this case, 
<S2[W] = 1/2, which represents the lower bound of Eq. (fn]). We could conjecture, although 
we do not have a formal proof, that this is the general result : the minimum entropy increase 
due to smoothing with a Gaussian kernel is attained when the width of the kernel is close to 
the width of the function to be smoothed. 

Another interesting example is provided by the harmonic oscillator, whose Hamiltonian is 

9 2 
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The eigenstates can be expressed in terms of Hermite polynomials H n (£) (Hq = 1, H\ 
2£, # 2 = 4£ 2 -2,...) 



Mx) = (2"n!)- 1 /2 ^ V4 exp (-^f) H n (x^JJh). (56) 



The corresponding Wigner functions are 



l) n ( 2H\ r (AH 

— — PYn I 



H y„ ) = l^ exp ^__j L ^_j , (57) 

where H(x,p) is the Hamiltonian, and the L n (£) are Legendre polynomials (Lq = 1, L\ = 
1 — £, L2 = 1 — 2^ + ^ 2 /2, ...). We now smooth such Wigner functions with a Gaussian kernel, 
and find § 

W„(a;,p) = (2Trhn\y 1 (H/huj) n exp(-H/huj) . (58) 

Note that this relatively simple result for W n is obtained only in the case when the square 
variance of the smoothing kernel [see Eq. (2£)] is a 2 = h/2mu>; in all other cases the smoothed 
Wigner function is not a function of the energy only. We are now in a position to compute 
the information I[VF n ] =I n = 2ith JW n dxdp. Let us first change to polar coordinates (r, 9) 
in the phase space 

h muj 2 x 2 = fiujr 2 , dxdp = firdrdO . (59) 

m 

One obtains, after integration over 

roa 

I n = (n!)- 2 / (r 2 /2) 2?l exp(-r 2 )rdr , (60) 
Jo 

and finally, changing variable again z = r 2 /2 

In = (niy 2 z 2n eM~2z)dz = • (61) 

We first note that Jo = 1/2, in agreement with previous results, since the ground state of the 
harmonic oscillator is a Gaussian, and we are smoothing with another Gaussian of identical 
width. It can also be shown that I n is a decreasing function of n. The asymptotic expansion 
(for n S> 1) is obtained by taking the logarithm of Eq. (|6l|) and making use of Stirling's 
formula 

lnX! ~X\nX-X + ]^\nX (X » 1) , 

which yields 

7n ~ n~ 1 / 2 . (62) 
In terms of the entropy, we have in summary 

5 2 [W ] = l/2 _ 

S 2 Wn+l) > HW n ] (63) 

lim^oo S 2 ^W n ] = 1 • 

The latter results means that the entropy increase is larger when smoothing a semi-classical 
state. Asymptotically, the entropy of the smoothed Wigner function approaches unity. On 
the other hand, when smoothing a 'fully quantum' state (i.e. a state with small quantum 
numbers), the entropy increase is moderate. Although these results were obtained for the 
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special case of the harmonic oscillator, we are confident that they remain qualitatively correct 
for other (classically integrable) Hamiltonians. 

VI. Discussion 

In this paper we have presented several results related to a new definition of quantum entropy, 
denoted £2. Although it has already been used in the past in the framework of the density 
matrix formalism, such entropy becomes particularly interesting when applied to Wigner 
functions. It is then possible to show that S2 possesses a number of interesting properties 
— most importantly, for example, it is an invariant for the Wigner equation, which governs 
the evolution of Wigner functions. S2 is related to the Tsallis entropy, although the latter is 
usually defined for a discrete set of probabilities, rather than for a continuous distribution. 
An advantage of this entropy, compared to the quantum Von Neumann entropy, is that the 
Wigner function is all that one needs to compute S^- No knowledge of the density matrix is 
required, neither does it need to be diagonalized, as is the case for the Von Neumann entropy. 

The standard properties of entropy (concavity, additivity, sub-additivity) have been exam- 
ined. This has revealed some interesting facts, which would require further investigations. 
For instance, it has been proven that S2 (unlike ordinary entropy) behaves like a proba- 
bilty with respect to additivity properties, which is also consistent with the normalization 
< S2 < 1. Secondly, the analysis of the canonical ensemble has enabled us to derive a 
Wigner function W eq that maximizes the entropy under certain constraints. W cq turns out 
to be both a function of the energy alone and a stationary solution of the Wigner equation. 
The relevance of W eq is still unclear, but one could reasonably conjecture that it plays a role 
in some relaxation processes. Numerical experiments could clarify this point. 

An "unpleasant" property of S2 is that, keeping the Wigner function fixed, and letting 
Planck's constant go to zero, one obtains S2 = 1. Thus it would seem that all classical 
states have unit entropy. The point is that this is not the correct procedure to obtain a clas- 
sical state: indeed, if the original Wigner function is negative somewhere, we would obtain 
a classical state with a non-positive probability distribution, which is of course meaningless. 
The correct procedure is instead to smooth the Wigner function W with an appropriate ker- 
nel, which must also be a Wigner function in order to ensure positivity. A crucial point, 
however, is that the smoothed Wigner function W should be itself an admissible quantum 
state, i.e. one that can be described by a density matrix with non-negative eigenvalues. We 
have been able to prove that, when smoothing with a minimum uncertainty Gaussian packet, 
the result is always admissible — although this is not necessarily the case when smoothing 
with another Wigner function. This is, to our knowledge, the first rigorous argument showing 
that Gaussian smoothing possesses some privileged status. 

It has also been proven that smoothing increases the entropy: in particular, when smoothing 
a pure state with a Gaussian kernel, one has S^f^] > 1/2. It would be interesting to know 
how to minimize S^W]. This could be done by varying the width a of the Gaussian kernel, 
which is still a free parameter. Although we are not able to derive a rigorous risult, we have 
conjectured (and shown explicitly on a particular example) that ^[W] is minimum when the 
width of the Gaussian kernel is close to the width of the Wigner function to be smoothed. 
This would not be unreasonable from the information point of view : it would mean that 
we can minimize the entropy increase if we have some prior knowledge of the function to be 
smoothed. 
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As a further example, we have computed the entropy of the (smoothed) stationary states 
of the harmonic oscillator. It was shown that S2 increases with quantum number, therefore 
semi-classical states yield a larger entropy increase. Again, we have conjectured that this 
behavior is universal (at least for confining and classically integrable Hamiltonians) , and not 
specific to the harmonic oscillator. We are rather confident that our conjecture is correct 
since the larger entropy increase for semi-classical states is mainly due to the fact that their 
Wigner function displays short-wavelength oscillations in the phase space, which are easily 
erased by the smoothing procedure. 

It would be interesting to know how the previous results generalizes to classically non- 
integrable Hamiltonians. For the harmonic oscillator, it was found that the information 
of the smoothed stationary states behaves as I n ~ n -1 / 2 . Although the exponent —1/2 
might be specific to the harmonic oscillator, a polynomial law may be universal for the class 
of integrable Hamilationans. On the other hand, one could conjecture that, for non-integrable 
Hamiltonians, the decrease is faster, perhaps exponential. 



From the physical point of view, this result means that semi-classical states are highly un- 
stable under generic perturbations (amongst which smoothing is a relevant example). This is 
reminiscent of the so-called 'predictability sieve', a concept introduced by W.H. Zurek and co- 
workers H in the more general framework of decoherence [12, 13]. Zurek et al. || construct 
a model for the interaction of a quantum system with an environment at thermodynamic 
equilibrium, and compute the rate at which initially pure states deteriorate into mixtures by 
coupling with the environment. This process is known as decoherence. Subsequently, they 
look for the set of states which are least prone to deterioration, and find that such states are 
those which yield the minimum entropy increase. By estimating the entropy production, they 
obtain that the minimum-entropy increase is attained for the ground state of the harmonic 
oscillator, i.e. a minimum uncertainty Gaussian wavepacket. This coincides with our results 
of Sec. V. 



The main difference from our approach is that W.H. Zurek and co-workers || analyze a 
dynamical situation, while in our case the entropy-producing effect is the smoothing, which is 
a static process. Since both cases appear to give the same result, it is reasonable to conjecture 
that smoothing may represent a (simplified) model for the interaction of a quantum system 
with an open environment. The price to pay for our approach is that we do not have a 
first-principle based derivation of such an interaction. The advantage is that the model is 
simple enough to obtain a number of rigorous results. 

These considerations may shed some new light on the semi-classical limit. We distinguish two 
kinds of pure quantum states: fully quantum (FQ) states Wfq (with low quantum numbers), 
and semi-classical (SC) states Wsc (with large quantum numbers). For both S2 = 0, i.e. 
they contain the same amount of information. However, after the smoothing, one obtains 
SM^fq] — 1/2 and S^fW^sc] ~~ * 1> i- e - the smoothed FQ state contains more information 
than the smoothed SC state. In other words, although both original states contain the 
same information, this is of different 'quality': robust for the FQ state, and highly prone to 
deterioration for the SC state. It is not surprising, therefore, that coupling to an environment 
has the effect of erasing such information less easily in the former case than in the latter. These 
results could open new avenues for further research, particularly with computer experiments 
[p~I|], to investigate the dynamical behavior of the entropy defined in this paper. 
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Appendix 

We want to prove the result of Eq. (^) . Let us use the identity 

f(x,p) g(x,p)dxdp = ^ J f(k,\)g*(k,\)dkd\ , 

with f = W and g = W. Since W(x,p) = W * W, the double Fourier transform of W is 
W 2 (k, A). In addition, it will turn out that W(k, A) is real for the case under consideration 
here. Therefore, by making use of the previous identity, the left-hand side of Eq. ( |37| ) becomes 

j W(x,p)W(x,p)dxdp= J W 3 (k,X)dkdX . 

The double Fourier transform W(k, A) is given by Eq. ©. For our example, the wavefunc- 
tion is the one of Eq. ( |35| ) , and we obtain 



/ — r ( X 2 h 2 \ 

W(k,X) =4 v /2/7rexp(-A 2 /i 2 /2) / lx 2 — ) exp(-2x 2 ) exp(-ikx)dx . 

Now, by using the following integrals 

J exp(— 2x 2 ) exp(— ikx)dx = exp(— k 2 /8) 

f Fk k 2 

/ x 2 exp(— 2x 2 ) exp(— ikx)dx = J— (1 — ) exp(— k 2 /8) , 

J V 8 4 

we obtain, after some straightforward algebra 

^,A)=(l-^-A 2 ^ex P (-^-^ . 

We are now ready to compute the integral / W 3 dkdX. Let us change integration variables 
(k,X) -> (r,cp) 

r = h A h ; rdrdip = —dXdk . 

4 ^ 2 

After integration over tp, one obtains 

/OO /-OO 4-7J- /-oo Q 

/ W 3 dkdX = — / (1 - r 2 ) 3 exp(--r 2 ) rdr . 
-oo J-oo n Jo 2 

Changing the integration variable to y = r 2 and using integrals of the type 



y n exp(-ay)dy 



a n+l 



it is obtained 

/OO /-OO 27T Z" 00 S 4-7T 

.L^-Xi (l-v) 3 exp(-^)^ = -^ 

which, once divided by 47r 2 , yields the result of Eq. 
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